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Response Under 37 CFR §1.111 

IN THE CLAIMS 



Please amend the claims as follows. 



Claim 1 (currently amended): A method for classifying electronically posted documents, 
the method comprising: 

receiving a first document and a second document; 

generating a first metadata summary corresponding to said first document and a 
second metadata summary corresponding to the second document, wherein the first 
metadata summary includes a first summary sub-tree and the second metadata 
summary includes a second sub-tree and wherein a sub-tree includes a plurality of list 
items : 

comparing the s tructur e list items of the first summary sub-tree with the s tructur e 
list items of the second summary sub-tree; and 

identifying the first and second documents as distinct if the s tructur e s list items of 
the first and second summary sub-trees are not equivalent. 

Claim 2 (currently amended): The method of claim 1 , wherein tho first oummary sub - 
tree each list item includes at least one attribute having a first an attribute value, a«4 
wh e r ei n - tho socond summary sub-tr eo- i nc l ud e s at le ast ono attributo having a se cond 
attribut e v al u e , the method further comprising: 

comparing for e ach of th e a t -l oast on e attr i bute s , tho f i rst and s e cond - attribut e 
values -the attribute value of a list item of the first summary sub-tree with the attribute 
value of a list item of the second summary sub-tree ; and 

identifying the first and second documents as distinct if the attribute values of the 
first and second summary sub-trees are not equivalent. 

Claim 3 (currently amended): The method of claim 1, wherein a list item of the first 
summary sub-tree includes text content, and wherein a list item of the second summary 
sub-tree includes text content the method comprising: 
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Response Under 37 CFR §1.111 

comparing the text content included within the first and second summary sub- 
trees; and 

identifying the first and second documents as distinct if the text content of the 
first and second summary sub-trees are not equivalent. 

Claim 4 (currently amended): The method of claim 2, wherein a list item of the first 
summary sub-tree further includes text content, and wherein a list item of the second 
summary sub-tree includes text content, the method further comprising: 

comparing the text content included within the first and second summary sub- 
trees; and 



identifying the first and second documents as distinct if the text content included 



Claim 5 {currently amended): The method of claim 4, further comprising identifying the 
first and second documents as duplicates if the text content within the list items of the 
first and second summary sub-trees are equivalent. 

Claim 6 (currently amended): The method of claim 5, further comprising removing the 
second metadata summary from th e fir o t c ummary group if the structures of the first 
and second summary sub-trees are equivalent and i f th o fir s t summary va l ue i s 
oqu i val e nt to tho second summary valuo for e ach of tho at l oaot on e attr i butes . 

Claim 7 (currently amended): The method of claim 1, further comprising: 
defining a first equivalence metadata table comprising: 

a first row corresponding to the first metadata summary; 

a second row corresponding to the second metadata summary; 

a first column corresponding to the first metadata summary; and 

a second column corresponding to the second metadata summary, 
wherein the process of identifying the first and second documents as distinct if the 
structur e s list items of the first and second summary sub-trees are not equivalent 
comprises storing a zero binary value in the first row and second column position of the 




within the first and second summary sub-trees are not equivalent. 
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first equivalence metadata table summary . 

Claim 8 (currently amended): The method of claim 2, further comprising: 
defining a first equivalence metadata table comprising: 

a first row corresponding to the first metadata summary; 
a second row corresponding to the second metadata summary; 
a first column corresponding to the first metadata summary; and 
a second column corresponding to the second metadata summary, 
wherein the process of identifying the first and second documents as distinct if the 
attribute values of the list items of the first and second summary sub-trees are not 
equivalent comprises storing a zero binary value in the first row and second column 
position of the first equivalence metadata table summary . 

Claim 9 (currently amended): The method of claim 3, further comprising: 
defining a first equivalence metadata table comprising: 

a first row corresponding to the first metadata summary; 
a second row corresponding to the second metadata summary; 
a first column corresponding to the first metadata summary; and 
a second column corresponding to the second metadata summary, 
wherein the process of identifying the first and second documents as distinct if the text 
content of the list items of the first and second summary sub-trees are not equivalent 
comprises a storing of zero binary value in the first row and second column position of 
the first equivalence metadata table summary . 

Claim 10 (currently amended): A method for classifying electronically posted 
documents, the method comprising: 

receiving a plurality of documents; 

generating a respective plurality of metadata summaries corresponding to the 
plurality of received documents; 

grouping a first subset of the respective plurality of metadata summaries into a 
first summary group, the first summary group comprising summaries having a mime- 
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type designation; 

selecting a first metadata summary and a second metadata summary from the 
first summary group, wherein the first metadata summary includes a first summary sub- 
tree and the second metadata summary includes a second summary sub-tree and 
wherein a sub-tree includes a plurality of list items : 

comparing the structur e list items of the first summary sub-tree with the structure 
list items of the second summary sub-tree; and 

identifying the first and second documents as distinct if the s tructur e s list items of 
the first and second summary sub-trees are not equivalent. 

Claim 11 (currently amended): The method of claim 10 f wherein the step of grouping 
further comprises grouping a second subset of the respective metadata summaries into 
a second summary group, the second summary group comprising summaries having a 
second mime-type designation. 

Claim 12 (currently amended): A system for classifying electronically posted 
documents, the system comprising: 

a metadata parser module coupled to receive electronically posted documents, 
the metadata parser configured to output respective metadata summaries, wherein 
each respective metadata summary comprises one or more sub-trees, wherein a sub- 
tree includes a plurality of list items and wherein a list item includes at least one 
attribute and at least one attribute value comprising text content structures, one or mor e 
attribut e s, and content toxt ; 

a summary repository coupled to receive and store the respective metadata 
summaries; and 

a summary consolidator coupled to the summary repository, the summary 
consolidator configured to compare the list items of sub-trees, identify metadata 
summaries as distinct if the list items of metadata summaries are not equivalent and 
delete duplicate metadata summaries from the summary repository. 
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Claim 13 (currently amended): The system of claim 12, wherein the summary 
consolidator comprises: 

a sub-tree comparator configured to compare one or more sub-trees 
struoturos of the retrieved metadata summaries; 

an attribute comparator configured to compare the attribute values of the 
list items of the sub-trees of the retrieved metadata summaries; and 

a text comparator configured to compare the text content included within 
the attribute values of the list items of the sub-trees of the retrieved metadata 
summaries. 



Claim 14 (currently amended): The system of claim 13, wherein the sub-tree 
comparator is configured to compare the a metadata portion of the metadata summary. 

Claim 15 (currently amended): The system of claim 13, wherein the attribute 
comparator is configured to compare the attribute values included within the metadata 
portion of the metadata summary. 

Claim 16 (currently amended): The system of claim 13, wherein the text comparator is 
configured to compare the text content included within the metadata portion of the 
metadata summary. 

Claim 17 (currently amended): A program product for use in a computer system that 
executes program steps recorded in a computer-readable media to perform a method 
for classifying electronically posted documents, the program product comprising: 
a record-able media; 

a program of computer-readable instructions executable by the computer system 
to perform processes comprising: 

receiving a first document and a second document; 

generating a first metadata summary corresponding to said first document 
and a second metadata summary corresponding to the second document, wherein the 
first metadata summary includes a first summary sub-tree and the second metadata 
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summary includes a second summary sub-tree and wherein a sub-tree includes a 
plurality of list items : 

comparing the structure list items of the first summary sub-tree with the 
6 tructuR > list items of the second summary sub-tree; and 

identifying the first and second documents as distinct if the s tructur e s list 
items of the first and second summary sub-trees are not equivalent. 

Claim 18 (currently amended): The program product of claim 17, wherein tho first 
su mmary 6 ub - tr e o each list item includes at least one attribute having a first an attribute 
value, and wh e r e in tho sooond summary cub - tr ee i ncludes at least ono attribut e having 
a se cond attr i buto va l ue, the program product method further comprising the processes 
of: 

comparing for e ach of tho at l oast ono attr i but es , th e first and s e cond attributo 
values -the attribute value of a list item of the first summary sub-tree with the attribute 
value of a list item of the second summary sub-tree : and 

identifying the first and second documents as distinct if the attribute values of the 
first and second summary sub-trees are not equivalent. 

Claim 19 (currently amended): The program product of claim 18, wherein a list item of 
the first summary sub-tree includes text content, and wherein a list item of the second 
summary sub-tree includes text content, the program product further comprising the 
processes of: 

comparing the text included within the first and second summary sub-trees; and 
identifying the first and second documents as distinct if the text content of the 
first and second summary sub-trees are not equivalent. 

Claim 20 (currently amended): The program product of claim 19, further comprising the 
method step of identifying the first and second documents as duplicates if the text 
content within the list items of the first and second summary sub-trees are equivalent. 



AM9-99-074 7 09/513,058 



PACE 10/17 " RCVD AT 12/30/2003 11:54:45 AM [Eastern Standard Time] * SVR:USPTO-EFXRF-1f3 " DN1S:87203OB * CSID:56 1-989-98 12 * DURATION (mm-ss):08-10 



/ 

f 

r 

| 12/38/2003 11:54 

Response Under 37 CFR §1.111 

Claim 21 (currently amended): The program product of claim 20, further comprising the 
process of removing the second metadata summary from tho f i rst oummary group . 

Claim 22 {currently amended): The program product of claim 21, further comprising the 
processes of: 

defining a first equivalence metadata table comprising: 

a first row corresponding to the first metadata summary; 
a second row corresponding to the second metadata summary; 
a first column corresponding to the first metadata summary; and 
a second column corresponding to the second metadata summary, 
wherein the process of identifying the first and second documents as distinct if the text 
content of the list items of the first and second summary sub-trees are not equivalent 
comprises storing a zero binary value in the first row and second column position of the 
equivalence metadata summary. 

Claim 23 (currently amended): The method of claim 18, further comprising the 
processes of: 

defining the first equivalence metadata table comprising: 

a first row corresponding to the first metadata summary; 
a second row corresponding to the second metadata summary; 
a first column corresponding to the first metadata summary; and 
a second column corresponding to the second metadata summary, 
wherein the process of Identifying the first and second documents as distinct if the 
attribute values of the list items of the first and second summary sub-trees are not 
equivalent comprises storing a zero binary value in the first row and second column 
position of the equivalence metadata summary. 

Claim 24 (currently amended): The method of claim 19, further comprising the 
processes of: 

defining a first equivalence metadata table comprising: 

a first row corresponding to the first metadata summary; 
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a second row corresponding to the second metadata summary; 
a first column corresponding to the first metadata summary; and 
a second column corresponding to the second metadata summary, 
wherein the process of identifying the first and second documents as distinct if the text 
content of the list items of the first and second summary sub-trees are not equivalent 
comprises storing a zero binary value in the first row and second column position of the 
equivalence metadata summary. 

Claim 25 (currently amended): A program product for use in a computer system that 
executes program steps recorded in a computer-readable media to perform a method 
for classifying electronically posted documents, the program product comprising: 
a record-able media; 

a program of computer-readable instructions executable by the computer system 
to perform method steps comprising: 

receiving a plurality of documents; 

generating a respective plurality of metadata summaries corresponding to the 
plurality of received documents; 

grouping a first subset of the respective plurality of metadata summaries onto a 
first summary group, the first summary group comprising summaries having a mime- 
type designation; 

selecting a first metadata summary and a second metadata summary from the 
first summary group, wherein the first metadata summary includes a first summary sub- 
tree and the second metadata summary includes a second summary sub-tree and 
wherein a sub-tree includes a plurality of list items ; 

comparing the structur e list items of the first summary sub-tree with the structure 
list items of the second summary sub-tree; and 

identifying the first and second documents as distinct if the structur e s list items of 
the first and second summary sub-trees are not equivalent. 

Claim 26 (currently amended): The program product of claim 25, wherein the step of 
grouping further comprises tho stop of grouping a second subset of the respective 
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metadata summaries into a second summary group, the second summary group 
comprising summaries having a second mime-type designation. 
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